
Exhibit A 

Clean Version of The Pending Claims in U.S. Patent Application Ser. No. 10/041.770 

1. (Amended) An isolated nucleic acid molecule comprising the nucleotide sequence 
of SEQIDNO:l. 

2. (Amended) An isolated nucleic acid molecule comprising a nucleotide sequence 

that: 

(a) encodes the amino acid sequence shown in SEQ ID NO:2; and 

(b) hybridizes to the nucleotide sequence of SEQ ID NO: 1 or the 
complement thereof under highly stringent conditions of 0.5 M 
NaHP0 4 , 7% sodium dodecyl sulfate (SDS) and 1 mM EDTA at 65°C 
and washing in O.lx SSC/0.1%SDS at 68°C. 

3. An isolated nucleic acid molecule comprising a nucleotide sequence that 
encodes the amino acid sequence described in SEQ ID NO:2. 

4. (Amended) An isolated expression vector comprising a nucleic acid molecule that 
comprises the nucleotide sequence of SEQ ID NO:l. 

5. (New) An isolated expression vector comprising a nucleic acid molecule that 
encodes the amino acid sequence described in SEQ ID NO:2. 

6. (New) A host cell comprising the expression vector of claim 4 or 5. 
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Exhibit A 

Clean Version of The Pending Claims in U.S. Patent Application Ser. No. 10/041.770 

1. (Amended) An isolated nucleic acid molecule comprising the nucleotide sequence of 
SEQ ID NO: 1. 

2. (Amended) An isolated nucleic acid molecule comprising a nucleotide sequence that: 

(a) encodes the amino acid sequence shown in SEQ ID NO:2; and 

(b) hybridizes to the nucleotide sequence of SEQ ID NO: 1 or the complement 
thereof under highly stringent conditions of 0.5 M NaHP0 4 , 7% sodium 
dodecyl sulfate (SDS) and 1 mM EDTA at 65°C and washing in 0.1 x 
SSC/0.1%SDS at68°C. 

3. An isolated nucleic acid molecule comprising a nucleotide sequence that encodes 
the amino acid sequence described in SEQ ID NO:2. 

4. (Amended) An isolated expression vector comprising a nucleic acid molecule that 
comprises the nucleotide sequence of SEQ ID NO:l. 

5. (New) An isolated expression vector comprising a nucleic acid molecule that encodes 
the amino acid sequence described in SEQ ID NO:2. 

6. (New) A host cell comprising the expression vector of claim 4 or 5. 
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Exhibit B 

Marked Up Version of Amended Claims in U.S. Patent Application Ser. No. 1 0/041.770 



1. (Amended) An isolated nucleic acid molecule comprising [at least 60 contiguous 
nucleotides from] the nucleotide sequence of SEQ ID NO:l. 

2. (Amended) An isolated nucleic acid molecule comprising a nucleotide sequence that: 

(a) encodes the amino acid sequence shown in SEQ ED NO: 2; and 

(b) hybridizes [under highly stringent conditions] to the nucleotide sequence of 
SEQ ID NO:l or the complement thereof under highly stringent conditions 
of 0.5 M NaHPO.. 7% sodium dodecvl sulfate (SDS^I and 1 mM EDTA at 
65°C and washing in O.lx SSC/0.1%SDS at 68°C . 

3. An isolated nucleic acid molecule comprising a nucleotide sequence that encodes 
the amino acid sequence described in SEQ ID NO:2. 

4. (Amended) An isolated expression vector comprising a nucleic acid molecule that 
comprises [at least 60 contiguous nucleotides from] the nucleotide sequence of SEQ ID NO:l. 

5. (New) An isolated expression vector comprising a nucleic acid molecule that encodes 
the amino acid sequence described in SEQ ID NO:2. 

6. (New) A host cell comprising the expression vector of claim 4 or 5. 
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Score E 

Sequences producing significant alignments: (bits) Value 

AL356356. 17. 1.176550 1386 0.0 

>AL356356 ACCESSION: AL3 56356 NID: gi 18476570 emb AL356356.17 Human DNA 
sequence from clone RP11-54A4 on chromosome 1, complete 
sequence 
Length = 176550 

Identities = 699/699 (100%) 
Strand = Plus / Plus 

Query: 433 aggtcccggcttcgagaccccatcaagccaggaatgttcggttatgggagagtgcccttt 492 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II II II II I I I I I I I I I II I I I I I I I I I I I 
Sbjct: 80686 aggtcccggcttcgagaccccatcaagccaggaatgttcggttatgggagagtgcccttt 80745 

Query : 493 gcattgccactgcaccggaaccgcaggcaccctcggagcccacccagatctgagctgtcc 552 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct: 80746 gcattgccactgcaccggaaccgcaggcaccctcggagcccacccagatctgagctgtcc 80805 
Query: 553 ctgatctcttctagaggggaagaggctattccgtcccctactccaagagcagagccattc 612 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct: 80806 ctgatctcttctagaggggaagaggctattccgtcccctactccaagagcagagccattc 80865 
Query: 613 tccgcaaacggcagcccccaaactgagctccctcccacagaactgtctgtccacacccca 672 

II HIUII III III lllilllllMIIMIIMIillllllllllllllllll 

Sbjct: 80866 tccgcaaacggcagcccccaaactgagctccctcccacagaactgtctgtccacacccca 80925 
Query: 673 tccccccaagcagaacctctaagccctgaaactgctcagacagaggtggcccccagaacc 732 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 III II II INI INI 1 1 III 1 1 1 1 II II II 1 1 II I lllllll 

Sbjct: 80926 tccccccaagcagaacctctaagccctgaaactgctcagacagaggtggcccccagaacc 80985 
Query: 733 aggcctgcccccctacggcatcaccccagagcccaggcctctggcacagagcccccctca 792 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct: 80986 aggcctgcccccctacggcatcaccccagagcccaggcctctggcacagagcccccctca 81045 
Query: 793 cccacgcactccttaggagaaggtggcttcttccgtgcatcccctcagccacgaaggcca 852 

IMIIIIIMIIIIMIMII II lllllllllllllllllllll MM III MINIMI 

Sbjct: 81046 cccacgcactccttaggagaaggtggcttcttccgtgcatcccctcagccacgaaggcca 81105 
Query: 853 agttcccagggttgggccagtccccaggtagcagggagacgccctgatccttttccttcg 912 

I II 1 1 II II II 1 1 M I M M M I II 1 1 1 M I M 1 1 M 1 1 1 1 1 1 II 1 1 M II 1 1 1 II I II I 

Sbjct: 81106 agttcccagggttgggccagtccccaggtagcagggagacgccctgatccttttccttcg 81165 



Query: 913 gtccctcggggccgaggccagcagggccaagggccttggggaacgggggggactcctcac 972 

II MINIMI I MINIUM III ! MINI III MM I M Ml ! Ml IN I Ml Ml 

Sbjct: 81166 gtccctcggggccgaggccagcagggccaagggccttggggaacgggggggactcctcac 81225 
Query: 973 gggccccgcctggagcctgaccctcagcacccgggcgcctggctgcccctgctgagcaac 1032 

i 1 1 1 1 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 M I i 1 1 ! II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M M 

Sbjct: 81226 gggccccgcctggagcctgaccctcagcacccgggcgcctggctgcccctgctgagcaac 81285 
Query: 1033 ggcccccatgccagctccctctggagcctctttgctcccagtagccctattccaagatgt 1092 

MM MM Ml II I llllll INI MIIIIIMI MM I II III lllllll I II MM 

Sbjct : 81286 ggcccccatgccagctccctctggagcctctttgctcccagtagccctattccaagatgt 81345 

Query: 1093 tctggggagagtgaacagctaagagcctgcagccaagcg 1131 

M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct: 81346 tctggggagagtgaacagctaagagcctgcagccaagcg 81384 



Identities = 360/360 (100%) 
Strand = Plus / Plus 

Query: 77 aggtgttgtccggacactctcttcagacacctacagaggagggccagggccccgaaggtg 136 

II III lllllll MIIIIIMI Mill I MM lllllll I II II I II MM I MM I II I 

Sbjct : 80158 aggtgttgtccggacactctcttcagacacctacagaggagggccagggccccgaaggtg ^80217 

Query: 137 tctggggaccttgggtccagtgggcctcttgctcccagccctgcggggtgggggtgcagc 196 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I M I I I I I I I 
Sbjct: 80218 tctggggaccttgggtccagtgggcctcttgctcccagccctgcggggtgggggtgcagc 80277 

Query: 197 gcaggagccggacatgtcagctccctacagtgcagctccacccgagtctgcccctccctc 256 

II II III MIIIIIMI III MM III III MM III III MM III III INI I 

Sbjct: 80278 gcaggagccggacatgtcagctccctacagtgcagctccacccgagtctgcccctccctc 80337 

Query: 257 cccggcccccaagacatccagaagccctcctcccccggggccagggtcccagaccccaga 316 

M II I I I I I I I I I M I I I I I I II I I I I II II I I I I I II I I II I II II I I II I I I I II I I I 
Sbjct: 80338 cccggcccccaagacatccagaagccctcctcccccggggccagggtcccagaccccaga 80397 

Query: 317 cttctccagaaaccctccccttgtacaggacacagtctcggggaaggggtggcccacttc 376 

lllllll I Ml Ml lllllll IMIM MM lllllll III Ml MM Ml I II MM 

Sbjct: 80398 cttctccagaaaccctccccttgtacaggacacagtctcggggaaggggtggcccacttc 80457 
Query: 377 gaggtcccgcttcccacctagggagagaggagacccaggagattcgagcggccaggaggt 436 

II MM II ! Ml II ! Ml Ml MM! IMMMIII MM I li Ml II II II Ml II Ml 

Sbjct: 80458 gaggtcccgcttcccacctagggagagaggagacccaggagattcgagcggccaggaggt 80517 



Identities = 252/255 (98%) 
Strand = Plus / Plus 



Query: 2380 cagtgctccgtgcggtgcggccggggccagagaagccggcaggttcgctgtgttgggaac 243 9 

I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Sbjct: 85732 cagtgctccgtgcggtgcggccggggccagagaagccggcaggttcgctgtgttgggaac 85791 

Query: 2440 aacggtgatgaagtgagcgagcaggagtgtgcgtcaggccccccacagccccccagcaga 2499 

II I i I i I I ! Ill I [ I [ I I 1 1 1 I [ 1 1 ! I I I i I I I i J I I I I. Ml II II M I Ml 

Sbjct: 85792 aatggtgatgaagtgagcgagcaggagtgtgcgtcaggccccccgcagccccccagcaga 85851 



Query : 
Sbjct : 



2500 



85852 



gaggcctgtgacatggggccctgtactactgcctggttccacagcgactggagctccaag 2559 

Ml III MM I II Mill I III I III I III I III 1 1 II I Ml MM MM 1 1 II I MM I 

gaggcctgtgacatggggccctgtactactgcctggttccacagcgactggagctccaag 85911 



Query : 
Sbjct : 



2560 



85912 



gtgagcccggaacccccagccatatcctgcatcctgggtaaccatgcccaggacacctca 2619 

II Ml I Ml I Ml MM I III 1 1 II MM I Ml I III I Ml I 1 1 1 1 1 1 1 1 1 M 1 1 1 1 

gtgagcccggaacccccagccatatcctgcatcctgggtaaccacgcccaggacacctca 85971 



Query: 2620 gcctttccagcatag 2634 

I I I I I I I I I II II I I 
Sbjct: 85972 gcctttccagcatag 85986 



Identities = 206/206 (100%) 
Strand = Plus / Plus 



Query: 1371 gagccccggctgtgatgggatccttggctctggcaggcgtcctgatggctgtggagtctg 1430 

MM! MMIMMMMMMMIIMIMMMIMMIIMIIIII IIMIIM 

Sbjct: 83423 gagccccggctgtgatgggatccttggctctggcaggcgtcctgatggctgtggagtctg 83482 

Query: 1431 tgggggtgatgattctacctgtcgccttgtttcggggaacctcactgaccgagggggccc 1490 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 83483 tgggggtgatgattctacctgtcgccttgtttcggggaacctcactgaccgagggggccc 83542 



Query: 
Sbjct : 



1491 



83543 



cctgggctatcagaagatcttgtggattccagcgggagccttgcggctccagattgccca 1550 

I II III I I I I I I I III I I I II I I I I I I I M I I I I I I M I I M M M I II I I I I I I II II I 
cctgggctatcagaagatcttgtggattccagcgggagccttgcggctccagattgccca 83602 



Query: 1551 gctccggcctagctccaactacctgg 1576 

I II 1 1 1 1 1 II II I II 1 1 1 1 M I II 1 1 

Sbjct: 83603 gctccggcctagctccaactacctgg 83628 



Identities = 205/205 (100%) 
Strand = Plus / Plus 



Query: 2178 ctgggaggctggcgagtggacatcctgcagccgctcctgtggccccggcacccagcaccg 2237 

II Illlllllll Mill llllll MUM III II 

Sbjct: 85207 ctgggaggctggcgagtggacatcctgcagccgctcctgtggccccggcacccagcaccg 85266 

Query: 2238 ccagctgcagtgccggcaggaatttggggggggtggctcctcggtgcccccggagcgctg 2297 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Sbjct: 85267 ccagctgcagtgccggcaggaatttggggggggtggctcctcggtgcccccggagcgctg 85326 

Query: 2298 tggacatctcccccggcccaacatcacccagtcttgccagctgcgcctctgtggccattg 23 57 

Ml III III Mill III III IN III II III MM MM III III I II I MM III II 

Sbjct: 85327 tggacatctcccccggcccaacatcacccagtcttgccagctgcgcctctgtggccattg 85386 

Query: 2358 ggaagttggctctccttggagccag 2382 

II I I i I I I I I I II I I I I I I I II I I I 
Sbjct: 85387 ggaagttggctctccttggagccag 85411 



Identities = 190/190 (100%) 
Strand = Plus / Plus 

Query: 1861 gagattctgagggtggagcccccacttgctccggcaccccgcccagcccggaccccaggc 1920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 84411 gagattctgagggtggagcccccacttgctccggcaccccgcccagcccggaccccaggc 8447 0 

Query: 1921 accctccagcgtcaggtgcggatcccccagatgcccgccccgccccatcccaggacaccc 1980 

MIIIIIM 1 1 1 1 ■ 1 1 1 1 Mill 

Sbjct: 84471 accctccagcgtcaggtgcggatcccccagatgcccgccccgccccatcccaggacaccc 84530 
Query: 1981 ctggggtctccagctgcgtactggaaacgagtgggacactctgcatgctcagcgtcctgc 2040 

II MINIM llllll MINIM 1 1 1 1 

Sbjct: 84531 ctggggtctccagctgcgtactggaaacgagtgggacactctgcatgctcagcgtcctgc 84590 

Query: 2041 gggaaaggtg 2050 

Illlllllll 
Sbjct: 84591 gggaaaggtg 84600 



Identities = 174/174 (100%) 
Strand = Plus / Plus 

Query: 1576 gcacttcgtggccctgggggccggtccatcatcaatgggaactgggctgtggatccccct 1635 

II II I II Ml 1 1 1 1 II II I II 1 1 1 1 1 1 1 MM I M M M I M MM I II II II I II 1 1 1 

Sbjct: 83882 gcacttcgtggccctgggggccggtccatcatcaatgggaactgggctgtggatccccct 83941 



Query: 163 6 gggtcctacagggccggcgggaccgtctttcgatataaccgtcctcccagggaggagggc 1695 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I M > 
Sbjct: 83942 gggtcctacagggccggcgggaccgtctttcgatataaccgtcctcccagggaggagggc 84001 

Query: 1696 aaaggggagagtctgtcggctgaaggccccaccacccagcctgtggatgtctat 1749 

I I I I II I I I I I I I I I I I II I II I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I 
Sbjct: 84002 aaaggggagagtctgtcggctgaaggccccaccacccagcctgtggatgtctat 84055 



Identities = 139/139 (100%) 
Strand = Plus / Plus 



Query: 1233 agtccagggctcccagcgctgtgaactgaactgccggccccgtggcttccgcttctatgt 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I 
Sbjct : 82689 agtccagggctcccagcgctgtgaactgaactgccggccccgtggcttccgcttctatgt 



1292 



82748 



Query: 1293 ccgtcacactgaaaaggtccaggatgggaccctgtgtcagcctggagcccctgacatctg 1352 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 82749 ccgtcacactgaaaaggtccaggatgggaccctgtgtcagcctggagcccctgacatctg 82808 



Query: 1353 tgtggctggacgctgtctg 1371 

I I I I I I I I I I I I I I I I I I I 
Sbjct: 82809 tgtggctggacgctgtctg 82827 



Identities = 132/132 (100%) 
Strand = Plus / Plus 



Query: 2046 aggtgtctggcgccccattttcctctgcatctcccgtgagtcgggagaggaactggatga 2105 

1 1 1 II I II 1 1 1 1 1 1 Mill I II 1 1 1 1 1 MM II III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

Sbjct: 84754 aggtgtctggcgccccattttcctctgcatctcccgtgagtcgggagaggaactggatga 84813 

/ 

Query: 2106 acgcagctgtgccgcgggtgccaggcccccagcctcccctgaaccctgccacggcacccc 2165 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 84814 acgcagctgtgccgcgggtgccaggcccccagcctcccctgaaccctgccacggcacccc 84873 



Query: 2166 atgccccccata 2177 

lililillliii 

Sbjct: 84874 atgccccccata 84885 



Identities = 112/112 (100%) 
Strand = Plus / Plus 



Query: 1750 atgatctttcaggaggaaaacccaggcgttttttatcagtatgtcatctcttcacctcct 1809 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

Sbjct: 84201 atgatctttcaggaggaaaacccaggcgttttttatcagtatgtcatctcttcacctcct 84260 
Query: 1810 ccaatccttgagaaccccaccccagagccccctgtcccccagcttcagccgg 1861 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct: 84261 ccaatccttgagaaccccaccccagagccccctgtcccccagcttcagccgg 84312 



Identities = 104/104 (100%) 
Strand = Plus / Plus 

Query: 1131 gccctgcccccctgagcagccagacccccgggccctgcagtgcgcagcctttaactccca 1190 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Sbjct: 81626 gccctgcccccctgagcagccagacccccgggccctgcagtgcgcagcctttaactccca 81685 

Query: 1191 ggaattcatgggccagctgtatcagtgggagcccttcactgaag 1234 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiniiiiiiiiiiiii 

Sbjct: 81686 ggaattcatgggccagctgtatcagtgggagcccttcactgaag 81729 



Identities = 65/65 (100%) 
Strand = Plus / Plus 

Query : 17 gcaggccctggc tgtatctgctgc tgct tctgtccctccctcagctc tgc t tggatcagg 7 6 

II II I MM III III MM I II II I INI I MM I II IN 1 1 II 1 1 1 II Ml II I INI 

Sbjct: 79801 gcaggccctggctgtatctgctgctgcttctgtccctccctcagctctgcttggatcagg 79860 

Query: 77 aggtg 81 
I I I I I 

Sbjct: 79861 aggtg 79865 



Identities = 21/21 (100%) 
Strand = Plus / Plus 



Query: 1 atggagaactggactggcagg 21 

■ II II II Mill I 

Sbjct: 79551 atggagaactggactggcagg 79571 
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COMMENT 



AL356356 176550 bp DNA linear PRI 30-JAN-2002 

Human DNA sequence from clone RP11-54A4 on chromosome 1, complete 
sequence . 
AL356356 

AL356356.17 GI : 18476570 
HTG . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 176550) 
Heath, P. 

Direct Submission 

Submitted (30-JAN-2002 ) Wellcome Trust Sanger Institute, Hinxton, 
Cambridgeshire, CB10 ISA, UK. E-mail enquiries: 

humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk 
On Feb 1, 2002 this sequence version replaced gi : 18250748 . 
During sequence assembly data is compared from overlapping clones. 
Where differences are found these are annotated as variations 
together with a note of the overlapping clone name. Note that the 
variation annotation may not be found in the sequence submission 
corresponding to the overlapping clone, as we submit sequences with 
only a small overlap as described above. 

This sequence was finished as follows unless otherwise noted: all 
regions were either double -stranded or sequenced with an alternate 
chemistry or covered by high quality data (i.e., phred quality >= 
30); an attempt was made to resolve all sequencing problems, such 
as compressions and repeats; all regions were covered by at least 
one plasmid subclone or more than one M13 subclone; and the 
assembly was confirmed by restriction digest. The following 
abbreviations are used to associate primary accession numbers given 
in the feature table with their source databases: Em:, EMBL; Sw: , 
SWISSPROT; Tr : , TREMBL; Wp : , WORMPEP; Information on the WORMPEP 
database can be found at 

http: //www. Sanger .ac . uk/Pro j ects/C„elegans/wormpep This sequence 
was generated from part of bacterial clone contigs of human 
chromosome 1, constructed by the Sanger Centre Chromosome 1 Mapping 
Group. Further information can be found at 
http: //www. sanger .ac .uk/HGP/Chrl 

RP11-54A4 is from the library RPCI-11.1 constructed by the group of 
Pieter de Jong. For further details see 
http : / /www . chor i . org/bacpac /home . htm 
VECTOR: pBACe3 . 6 

IMPORTANT: This sequence is not the entire insert of clone 
RP11-54A4 It may be shorter because we sequence overlapping 
sections only once, except for a short overlap. 

The true left end of clone RP11-54A4 is at 1 in this sequence. The 



http://www.ncbi.nlm.nih.gov/entrez/query.fcgi ?cmd=Retrieve&db=nucleotide&list_uids=184'; 7/7/2003 



>NM_019032 ACCESSION:NM_019032 NID: gi 24308134 ref NM_019032.1 

Homo sapiens thrombospondin repeat containing 1 (TSRC1) , 
mRNA 
Length = 2704 

Score = 950 bits (2428), Expect = 0.0 

Identities = 431/433 (99%), Positives = 431/433 (99%) 

Frame = +1 

Query: 425 RGFRFYVRHTEKVQDGTLCQPGAPDICVAGRCLSPGCDGILGSGRRPDGCGVCGGDDSTC 484 

RGFRFYVRHTEKVQDGTLCQPGAPDICVAGRCLSPGCDGILGSGRRPDGCGVCGGDDSTC 
Sbjct: 1 RGFRFYVRHTEKVQDGTLCQPGAPDICVAGRCLSPGCDGILGSGRRPDGCGVCGGDDSTC 180 

Query: 485 RLVSGNLTDRGGPLGYQKILWIPAGALRLQIAQLRPSSNYLALRGPGGRSIING3SfWAVT)P 544 

RLVSGNLTDRGGPLGYQKILWIPAGALRLQIAQLRPSSNYLALRGPGGRSIINGNWAVDP 
Sbjct: 181 RLVSGNLTDRGGPLGYQKILWIPAGALRLQIAQLRPSSNYLALRGPGGRSIINGNWAVDP 360 

Query: 545 PGSYRAGGTVFRYNRPPREEGKGESLSAEGPTTQPVDVYMIFQEENPGVFYQYVISSPPP 604 

PGSYRAGGTVFRYNRPPREEGKGESLSAEGPTTQPVDVYMIFQEENPGVFYQYVISSPPP 
Sbjct: 361 PGSYRAGGTVFRYNRPPREEGKGESLSAEGPTTQPVDVYMIFQEENPGVFYQYVISSPPP 540 

Query: 605 ILENPTPEPPVPQLQPEILRVEPPLAPAPRPARTPGTLQRQVRIPQMPAPPHPRTPLGSP 664 

ILENPTPEPPVPQLQPEILRVEPPLAPAPRPARTPGTLQRQVRIPQMPAPPHPRTPLGSP 
Sbjct: 541 ILENPTPEPPVPQLQPEILRVEPPLAPAPRPARTPGTLQRQVRIPQMPAPPHPRTPLGSP 720 

Query: 665 AAYWKRVGHS ACS ASCGKGVWRPI FLC I SRESGEELDERSC AAGARPPAS PEPCHGTPC P 724 

AAYWKRVGHSACSASCGKGVWRPIFLCISRESGEELDERSCAAGARPPASPEPCHGTPCP 
Sbjct: 721 AAYWKRVGHSACSASCGKGVWRPI FLC I SRESGEELDERSCAAGARPPAS PEPCHGTPC P 900 

Query: 725 PYWEAGEWTSCSRSCGPGTQHRQLQCRQEFGGGGSSVPPERCGHLPRPNITQSCQLRLCG 784 

PYWEAGEWTSCSRSCGPGTQHRQLQCRQEFGGGGSSVPPERCGHLPRPNITQSCQLRLCG 
Sbjct: 901 PYWEAGEWTSCSRSCGPGTQHRQLQCRQEFGGGGSSVPPERCGHLPRPNITQSCQLRLCG 1080 

Query: 785 HWEVGSPWSQCSVRCGRGQRSRQVRCVGNNGDEVSEQECASGPPQPPSREACDMGPCTTA 844 

HWEVGSPWSQCSVRCGRGQRSRQVRCVGNNGDEVSEQECASGPPQPPSREACDMGPCTTA 
Sbjct : 1081HWEVGSPWSQCSVRCGRGQRSRQVRCVGNNGDEVSEQECASGPPQPPSREACDMGPCTTA 1260 

Query: 845 WFHSDWSSKVSPE 857 

WFHSDWSSK S E 
Sbjct: 1261WFHSDWSSKCSAE 1299 
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C 1: NM_019032[gi:24308134] This record was temporarily removed by RefSeq staff for additional 
review. 
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NM_019032 2704 bp mRNA linear PRI 20-APR-2003 

Homo sapiens thrombospondin repeat containing 1 (TSRC1) , mRNA. 
NM_019032 

NM_019032.1 GI: 24308134 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 2704) 
Strausberg, R. 

Homo sapiens, hyothetical protein, clone MGC: 34896 IMAGE : 5001173 , 
mRNA 

Unpublished (2002) 

PROVISIONAL REFSEQ : This record has not yet been subject to final 
NCBI review. The reference sequence was derived from BC027478 . 1 . 

Location/Qualifiers 

1. .2704 

/organism="Homo sapiens" 
/mol_type=="mRNA" c 
/db_xref = " taxon : 9 606 " 
/chromosome= " 1 " 
/map="lq21.2" 

/clone= "MGC : 34896 IMAGE : 5001173 " 

/tissue_type=" Placenta, choriocarcinoma" 

/ c 1 one_l ib= " NI H_MGC_1 0 " 

/lab_host="DH10B" 

/note= "Vector: pCMV-SPORT6 " 

1. .2704 

/gene="TSRCl" 

/note=" synonym: DKFZP434K1772 " 

/db_xref="LocusID: 54507 " 

679. .1953 

/gene="TSRCl" 

/codon_start=l 

/product=" thrombospondin repeat containing 1" 
/protein_id= " NP_061905 . 1 " 
/db_xref="GI: 24308135" 
/db_xref = "Locus ID : 54507 " 

/ translation= "MPAPPHPRTPLGSPAAYWKRVGHSACSASCGKGVWRPIFLCISR 
ESGEELDERSCAAGARPPASPEPCHGTPCPPYWEAGEWTSCSRSCGPGTQHRQLQCRQ 
EFGGGGSSVPPERCGHLPRPNITQSCQLRLCGHWEVGSPWSQCSVRCGRGQRSRQVRC 
VGNNGDEVSEQECASGPPQPPSREACDMGPCTTAWFHSDWSSKCSAECGTGIQRRSW 
CLGSGAALGPGQGEAGAGTGQSCPTGSRPPDMRACSLGPCERTWRWYTGPWGECSSEC 
GSGTQRRDIICVSKLGTEFNVTSPSNCSHLPRPPALQPCQGQACQDRWFSTPWSPCSR 
SCQGGTQTREVQCLSTNQTLSTRCPPQLRPSRKRPCNSQPCSQRPDDQCKDSSPHCPL 
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WQARLCVYPYYTATCCRSCAHVLERSPQDPS " 
variation 1604 

/gene="TSRCl" 

/allele= B T" 

/allele="C" 

/ db^xr e f = " dbSNP : 1052180 " 
variation complement (2167 ) 

" /allele= ,, C" 
/allele= n A n 

/ db_xr e f = " dbSNP : 988767 " 
BASE COUNT 510 a 873 c 774 g 547 t 

ORIGIN 

1 cgtggcttcc gcttctatgt ccgtcacact gaaaaggtcc aggatgggac cctgtgtcag 

61 cctggagccc ctgacatctg tgtggctgga cgctgtctga gccccggctg tgatgggatc 

121 cttggctctg gcaggcgtcc tgatggctgt ggagtctgtg ggggtgatga ttctacctgt 

181 cgccttgttt cggggaacct cactgaccga gggggccccc tgggctatca gaagatcttg 

241 tggattccag cgggagcctt gcggctccag attgcccagc tccggcctag ctccaactac 

301 ctggcacttc gtggccctgg gggccggtcc atcatcaatg ggaactgggc tgtggatccc 

361 cctgggtcct acagggccgg cgggaccgtc tttcgatata accgtcctcc cagggaggag 

421 ggcaaagggg agagtctgtc ggctgaaggc cccaccaccc agcctgtgga tgtctatatg 

481 atctttcagg aggaaaaccc aggcgttttt tatcagtatg tcatctcttc acctcctcca 

541 atccttgaga accccacccc agagccccct gtcccccagc ttcagccgga gattctgagg 

601 gtggagcccc cacttgctcc ggcaccccgc ccagcccgga ccccaggcac cctccagcgt 

661 caggtgcgga tcccccagat gcccgccccg ccccatccca ggacacccct ggggtctcca 

721 gctgcgtact ggaaacgagt gggacactct gcatgctcag cgtcctgcgg gaaaggtgtc 

781 tggcgcccca ttttcctctg catctcccgt gagtcgggag aggaactgga tgaacgcagc 

841 tgtgccgcgg gtgccaggcc cccagcctcc cctgaaccct gccacggcac cccatgcccc 

901 ccatactggg aggctggcga gtggacatcc tgcagccgct cctgtggccc cggcacccag 

961 caccgccagc tgcagtgccg gcaggaattt ggggggggtg gctcctcggt gcccccggag 

1021 cgctgtggac atctcccccg gcccaacatc acccagtctt gccagctgcg cctctgtggc 

1081 cattgggaag ttggctctcc ttggagccag tgctccgtgc ggtgcggccg gggccagaga 

1141 agccggcagg ttcgctgtgt tgggaacaac ggtgatgaag tgagcgagca ggagtgtgcg 

1201 tcaggccccc cgcagccccc cagcagagag goctgtgaca tggggccctg tactactgcc 

1261 tggttccaca gcgactggag ctccaagtgc tcagccgagt gtgggacggg aatccagcgg 

1321 cgctctgtgg tctgccttgg gagtggggca gccctcgggc caggccaggg ggaagcagga 

1381 gcaggaactg ggcagagctg tccaacagga agccggcccc ctgacatgcg cgcctgcagc 

1441 ctggggccct gtgagagaac ttggcgctgg tacacagggc cctggggtga gtgctcctcc 

1501 gaatgtggct ctggcacaca gcgtagagac atcatctgtg tatccaaact ggggacggag 

1561 ttcaacgtga cttctccgag caactgttct cacctcccca ggccccctgc cctgcagccc 

1621 tgtcaagggc aggcctgcca ggaccgatgg ttttccacgc cctggagccc atgttctcgc 

1681 tcctgccaag ggggaacgca gacacgggag gtccagtgcc tgagcaccaa ccagaccctc 

1741 agcacccgat gccctcctca actgcggccc tccaggaagc gcccctgtaa cagccaaccc 

1801 tgcagccagc gccctgatga tcaatgcaag gacagctctc cacattgccc cctggtggta 

1861 caggcccggc tctgcgtcta cccctactac acagccacct gttgccgctc ttgcgcacat 

1921 gtcctggagc ggtctcccca ggatccctcc tgaaaggggt ccggggcacc ttcacggttt 

1981 tctgtgccac catcggtcac ccattgatcg gcccactctg aaccccctgg ctctccagcc 

2041 tgtcccagtc tcagcaggga tgtcctccag gtgacagagg gtggcaaggt gactgacaca 

2101 aagtgacttt cagggctgtg gtcaggccca tgtggtggtg tgatgggtgt gtgcacatat 

2161 gcctcaggtg tgcttttggg actgcatgga tatgtgtgtg ctcaaacgtg tatcactttt 

2221 caaaaagagg ttacacagac tgagaaggac aagacctgtt tccttgagac tttcctaggt 

2281 ggaaaggaaa gcaagtctgc agttccttgc taatctgagc tacttagagt gtggtctccc 

2341 caccaactcc agttttgtgc cctaagcctc atttctcatg ttcagacctc acatcttcta 

2401 agccgccctg tgtctctgac cccttctcat ttgcctagta tctctgcccc tgcctcccta 

2461 attagctagg gctggggtca gccactgcca atcctgcctt actcaggaag gcaggaggaa 

2521 agagactgcc tctccagagc aaggcccagc tgggcagagg gtgaaaaaga gaaatgtgag 

2581 catccgctac cccaccaccc cgcccagccc ctagccccac tccctgcctc ctgaaatggt 

2641 tcccacccag aactaattta ttttttatta aagatggtca tgacaaatga aaaaaaaaaa 
2701 aaaa 

// 
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>AY158701 ACCESSION: AY158701 NID: gi 30024980 gb AY158701.1 Mus 
musculus thrombospondin repeat protein 1 (Tsrcl) mRNA, 
complete cds 
Length = 3856 

Score = 1322 bits (3383), Expect = 0.0 

Identities = 640/860 (74%), Positives = 686/860 (79%), Gaps = 3/860 (0%) 
Frame = +\ 

Query: 1 MENWTGRPWLYLLLLLSLPQLCLDQEVLSGHSLQTPTEEGQGPEGVWGPWVQWASCSQPC 60 

ME+W GR WL ++LLL LPQ C DQE L G S Q P+EEGQ PEG+WGPW +WASCSQPC 
Sbjct: 117 MESWLGRLWLCMMLLLPLPQPCQDQE-LFGPSHQLPSEEGQVPEGLWGPWGRWASCSQPC 293 

Query: 61 GVGVQRRSRTCQLPTVQLHPSLPLPPRPPRHPEALLPRGQGPRPQTSPETLPLYRTQSRG 120 

GVGVQRR S RTC + L HP+LPLPPRPPRHPEA PRGQG RPQT + LYR Q RG 

Sbjct: 294 GVGVQRRS RTC EL HPALPLPPRPPRHPEAHRPRGQGSRPQTPRDPQSLYRPQPRG 458 

Query: 121 RGGPLRGPASHLGREETQE I RAARRSRLRDP IKPGMFGYGRVPFALPLHRNRRHPRS PPR 180 

RGGPLR PAS +GREETQE + A+R R+RDPIKPGMFGYGRVPFALPLHR+RRHP P + 
Sbjct: 459 RGGPLRAPASQVGREETQEPQGAQRFRVRDPIKPGMFGYGRVPFALPLHRSRRHPHRPGQ 638 

Query: 181 SELSLISSRGEEAIPSPTPRAEPFSANGSPQTELP-PTELSVHTPSPQAEPLSPETAQTE 239 

+ S S GE +PS PES P+PP STP TAQTE 
Sbjct: 639 PKNS STGEGMVPSQPPSTELASEKHGPHMQPPEPRSHSAETPRSG TAQTE. 788 

Query: 240 VAPRTRPAPLRHHPRAQASGTEPPSPTHSLGEGGFFRASPQPRRPSSQG-WASPQVAGRR 298 

V PRT AP ST P+PT S G+ F+ S PR P S G W+SPQ A RR 

Sbjct: 789 VLPRTSSAP SYTGTPAPTSSFGDSRSFQGSLGPRMPPSPGSWSSPQGAERR 941 

Query: 299 -PDPFPSVPRGRGQQGQGPWGTGGTPHGPRLEPDPQHPGAWLPLLSNGPHASSLWSLFAP 357 

P PF VPR QQ + W P GP PD WLPL + +S LWS+FAP 
Sbjct: 942 HPPPFSPVPRS--QQSRRHW RPPGPHRSPD GWLPLTRD SSPLWSIFAP- 1079 

Query: 358 SSPIPRCSGESEQLRACSQAPCPPEQPDPRALQCAAFNSQEFMGQLYQWEPFTEVQGSQR 417 

S P P CSGESEQ+RACSQ PCPPEQPDPRALQCAAF+SQEFMGQLYQWEPFTEVQGSQR 
Sbjct : 108 0 SI PAPNCSGESEQMRACSQEPC PPEQPDPRALQCAAFDSQEFMGQLYQWEPFTEVQGSQR 1259 

Query: 418 CELNCRPRGFRFYVRHTEKVQDGTLCQPGAPDICVAGRCLSPGCDGILGSGRRPDGCGVC 477 

CELNCRPRGFRFYVRHTEKVQDGTLCQPG+ DICVAGRCLSPGCDG+LGSGRRPDGCGVC 
Sbjct : 1260CELNCRPRGFRFYVRHTEKVQDGTLCQPGSLDICVAGRCLSPGCDGVXGSGRRPDGCGVC 1439 

Query: 478 GGDDSTCRLVSGNLTDRGGPLGYQKILWIPAGALRLQIAQLRPSSNYLALRGPGGRSIIN 537 

GGD STCRLVSGNLTDRGGPLGYQKILWI PAGA L I+QLRPSSNYLALRGPGGRSIIN 
Sbjct : 1440GGDGSTCRLVSGNLTDRGGPLGYQKILWIPAGASHLHISQLRPSSNYLALRGPGGRSIIN 1619 

Query: 538 GNWAVDPPGSYRAGGTVFRYNRPPREEGKGESLSAEGPTTQPVDVYMIFQEENPGVFYQY 597 

GNWAVDPPGSY A GTVF+YNRPPREEGKGESLSAEGPTTQPVDVYMIFQE+NPGVFYQY 
Sbjct : 1620GNWAVTDPPGSYTAIGTVFQYNRPPREEGKGESLSAEGPTTQPV1DWM 1799 

Query: 598 VISSPPPILENPTPEPPVPQLQPEILRVEPPLAPAPRPARTPGTLQRQVRIPQMPAPPHP 657 

VISSPP +LE+P+ +PP Q QPE+LR EP L APRP R PGTLQRQVRI PQ+ P P 
Sbjct : 1800VISSPPAVLESPSTKPPALQPQPEMLRGEPLLPSAPRPVRAPGTLQRQVRIPQVPPPTRV 1979 



Query: 658 RTPLGSPAAYWKRVGHSACSASCGKGVWRPIFLCISRESGEELDERSCAAGARPPASPEP 717 

RT +GS A YWK+VGHS CSASCGKGVW PIFLCISRESGEELDE+SCA GARPPASPEP 
Sbjct : 1980 RTAMGS S AGYWKQVGHSEC S ASCGKGVWHPI FLC I SRESGEELDEQSC AVGARPPAS PEP y 2 1 5 9 



Query: 718 CHGTPCPPYWEAGEWTSCSRSCGPGTQHRQLQCRQEFGGGGSSVPPERCGHLPRPNITQS 777 

CHG PCPPYWEAGEWTSCSRSCGPGTQHRQL CRQEFGGGGSSVPPERCGHLPRPNITQ 
Sbjct : 2160CHGPPCPPYWEAGEWTSCSRSCGPGTQHRQLLCRQEFGGGGSSVPPERCGHLPRPNITQP 233 9 

Query: 778 CQLRLCGFiWEVGSPWSQCSVRCGRGQRSRQVRCVG3SnSTGDEVSEQECASGPPQPPSREACD 837 

CQL LCGHWE+ SPWSQCSVRCGRGQRSRQVRCVG+NGDEV +QECASGPP PPSREACD 
Sbjct : 2340CQLHLCGHWEISSPWSQCSVRCGRGQRSRQVRCVGSNGDEVDKQECASGPPPPPSREACD 2519 

Query: 838 MGPCTTAWFHSDWSSKVSPE 857 

MGPCTTAWF+SDWSSK S E 
Sbjct: 2520MGPCTTAWFYSDWSSKCSAE 2579 
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AY158701 3856 bp mRNA linear ROD 19-APR-2003 

Mus musculus thrombospondin repeat protein 1 (Tsrcl) mRNA, complete 
cds . 

AY158701 

AY158701.1 GI:30024980 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 3856) 
Buchner,D.A. and Meisler,M.H. 

TSRC1, a widely expressed gene containing seven thrombospondin type 

1 repeats 

Gene 307, 23-30 (2003) 

2 (bases 1 to 3856) 
Buchner,D.A. and Meisler,M.H. 
Direct Submission 

Submitted ( 04-OCT-2002 ) Human Genetics, University of Michigan, 
4909 Buhl, Ann Arbor, MI 48109-0618, USA 

Location/Qualifiers 

1. .3856 

/organism="Mus musculus" 

/mol_type= " mRNA " 

/strain="C57BL/6J" 

/ db_xr e f = " t axon : 1 0 0 9 0 " 

/chromosome= " 3 " 

/map="F2" 

1. .3856 

/gene=" Tsrcl" 

/note= 11 thrombospondin repeat containing 1" 
117. .3227 
/gene=" Tsrcl" 
/codon_start=l 

/product=" thrombospondin repeat protein 1" 
/protein_id= " AA017738.1 " 
/db_xref="GI:30024981" 

/translations 11 MESWLGRLWLCMMLLLPLPQPCQDQELFGPSHQLPSEEGQVPEG, 
LWG PWGRWASjc S QPC GVGVQfRRS RTC EIj^I ^ AL PL P PR|^J3jHP EAHR PRGQG S R PQT PR^o 

dpqslyrpqprgrggplrapasqvgreetqepqgaqrfrvrdpikpgmfgVgrvpfal 
plhrsrrhphrpgqpknsstgegmvpsqppstelasekhgphmqgbepbshsaetprs 
gtaqtevlprtssapsytgtpaptssfgdsrsfqgslgprmppspgswsspqgaerrh 
pppfspvprsqqsrrhwrppgphrspdgwlpltrdssplwsifapsipapncsgeseq 
mracsqepcppeqpdpralqcaafdsqefmgqlyqwepftevqgsqrcelncrprgfr 
fyvrhtekvqdgtlcqpgsldicvagrclspgcdgvlgsgrrpdgcgvcggdgstcrl 
vsgnltdrggplgyqkilwipagashlhisqlrpssnylalrgpggrsiingnwavdp 
pgsytaigtvfqynr^reegkgeslsaegpttqpvdvymifqednpgvfyqyvissp 



http://www.ncbi.nlm.nih.gov/entrez/query.fcgi ?cmd=Retrieve&db=nucleotide&list_uids=300: 7/6/2003 
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PAVLE S P STKP P ALQ PQ P EML RGE PLL PS APRPVRAPGTLQRQVRI PQVP P PTRVRTA 
MGSSAGYWKQVGHSECSASCGKGVWHPIFLCISRESGEELDEQSCAVGARPPASPEPC' 
HGPPCPPYWEAGEWTSCSRSCGPGTQHRQLLCRQEFGGGGSSVPPERCGHLPRPNITQ 
PCQLHLCGHWEISSPWSQCSVRCGRGQRSRQVRCVGSNGDEVDKQECASGPPPPPSRE 
• ACDMGPCTTAWFYSDWSS^CS^jpGTGIQRRAWCLRSGETLQGDPEAGSTEQGGPLR 
SRPPDMRACSLGPCERTWRWFTGPWSECSSECGSGtQHRDIICVSKLGAEFNVTSPSN 
CSHLPRPPALQPCQGQACEDKWFSTLWSPCSRSCQGGMQTREVQCLSGNQTLSSRCPP 
HLRPSRKRPCNSQPCNQRPDDQCKDSSPHCPLWQARLCVYPYYTTTCCRSCAHVLEQ <n£? 
SQLEPA " ^0 
BASE COUNT 763 a 1232 c 1069 g 792 t ^ ; 

ORIGIN ^ 
1 actaccagag gcagctggag agagcgggcg cagagaagag caccctccaa gccccgatgc ~ 
61 cagagtagcc accctgtccc ctgctcccca gcccaggggc agtcacagga agagcaatgg " 
121 aatcttggct gggcaggctc tggctgtgta tgatgctgct cctgcctctc cctcagccct A&O' 
181 gccaggatca ggagctgttt ggaccttctc atcagctgcc atcagaggag ggccaggtcc y§ 
241 ctgaaggcct ctggggacct tggggtcgat gggcttcctg ctcccagccc tgtggggtgg 
301 gggtgcagcg caggagccga acatgtgaac tgcacccagc cctacccctc cctccccgac 
361 ccccaagaca tccagaagcc caccggcccc ggggccaggg ctccagaccc cagactccac 
421 gggatcccca gtctctgtat aggccacagc ctcggggaag gggtggccct cttcgagctc 
481 ctgcttctca agtggggaga gaggaaaccc aggagcctca aggggcccag aggttccggg 
541 ttcgagaccc catcaagcca gggatgttcg gttatgggag agtgcctttt gccttgcctc 
601 tgcaccggag ccgcaggcat cctcacagac ccgggcaacc caagaactct tctacagggg 
661 aagggatggt tccatctcag cctccaagca cagagctggc ctctgaaaag cacggccctc 
721 acatgcagcc ccctgaaccc cgttctcact cagctgaaac tccaaggtct ggaactgctc 
781 agacagaagt gctccccaga actagttctg ctccttccta cacagggacc cctgcaccta 
841 cctcctcttt tggagacagt aggtctttcc agggatccct tgggccacga atgccacctt 
901 ccccgggaag ttggagcagt ccccagggag cagagcgacg acacccacct cctttctctc 
961 ctgtcccccg gagccagcag agcaggaggc actggagacc tccaggtcct cacaggtccc 
1021 cagatggctg gctgcccctg acaagagact ccagcccact ctggagcatc tttgctccca 
1081 gtatccctgc tccaaactgt tctggggaaa gtgaacagat gcgagcctgc agccaggagc 
1141 cttgcccccc tgagcagcca gaccccaggg ccctacagtg tgccgccttt gactcccagg 
1201 aattcatggg ccagctgtac cagtgggagc ccttcaccga agttcagggc tcccagcgct 
1261 gtgaactgaa ctgccgcccc cgtggcttcc gattctatgt ccgacacact gaaaaggtgc 
1321 aggatggaac cctgtgtcag cctggatccc tagacatctg tgtggctgga cgctgcctga 
1381 gccctggctg tgatggggtc cttggctctg gcaggcgtcc ggatggctgc ggagtctgcg 
1441 ggggtgatgg ttctacctgc cgcctggttt cgggaaatct cactgaccga gggggcccct 
1501 tgggctatca gaagatcctg tggatccctg ccggggcctc ccaccttcac atttcccagt 
1561 tgcgacccag ttccaattac ctcgcactcc gagggcctgg gggccgctcc attatcaatg 
1621 ggaactgggc tgtggatcct ccagggtcct atacagccat tgggactgtc ttccagtata 
1681 accgtcctcc acgggaagaa ggcaaggggg agagtctgtc agcagaaggc cctaccactc 
1741 agcctgtgga cgtctatatg atctttcaag aggacaaccc aggtgttttt tatcagtatg 
1801 tcatctcttc ccctcctgca gtcctagaga gtccttccac aaagcctcca gcccttcagc 
1861 ctcagcctga gatgctgagg ggggagcccc tactcccctc agccccccgc ccagtccggg 
1921 caccaggcac cctccaacgt caggtgcgga ttccccaagt gcctcctccg actcgtgtca 
1981 ggacagccat ggggtcttct gctggatact ggaaacaagt ggggcactct gagtgttcag 
2041 catcctgtgg caaaggtgtt tggcacccca ttttcctctg catttcccgt gagtcaggag 
2101 aggagttgga tgaacagagc tgtgctgtgg gggccagacc cccagcttcc cctgaaccct 
2161 gccatggacc cccgtgtcct ccatactggg aggctggcga gtggacatcc tgcagccgat 
2221 cctgtggccc tggcacccag caccgccagc tactctgcag acaggaattc ggaggtggtg 
2281 gctcctcggt acctccagag cgctgtggac atctcccccg gcccaacatc acccagcctt 
2341 gtcagctgca cctctgtggc cactgggaga ttagctcccc ctggagccag tgctctgtgc 
2401 gctgtggtcg tggtcagagg agccggcaag ttcggtgtgt tggcagtaat ggtgatgagg 
2461 tggacaagca ggagtgtgct tcagggcccc cgccgccccc cagcagagag gcctgtgaca 
2521 tgggcccctg caccacagcc tggttctaca gtgactggag ttccaagtgc tcagccgagt 
2581 gcgggacagg aatccagaga cgcgctgtgg tctgccttag gagtggggag acccttcagg 
2641 gagacccgga agcaggaagc actgagcagg gctgccctct cagaagccgc cctcccgata 
2701 tgcgtgcctg cagtttaggg ccctgtgaga ggacatggcg ctggttcaca ggcccctgga 
2761 gtgagtgttc ctcggagtgt gggtctggca cacaacacag agacattatc tgtgtgtcca 
2821 agctgggggc cgagttcaat gtgacttctc ccagcaactg ttcccaccta cccaggccgc 
2881 ctgccctgca gccctgtcag gggcaggcct gtgaggacaa atggttctcc actctctgga 
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2941 gtccgtgttc tcgatcctgc cagggaggca 

3001 gcaaccagac tctcagctcc cgatgccctc 

3061 gcaacagcca accctgcaac cagcgaccag 

3121 gccccctggt ggtacaggcc cggctctgcg 

3181 gctcttgtgc ccatgtcctg gagcagtccc 

3241 gaccctgcct tcgtggtttt cttgtgccac 

3301 cttttcacct ggtctcagcc ttgccagagg 

3361 atagctcaag aagtgtacct gacctgcctg 

3421 cacaaaagca gcttgagaaa gaataggaag 

3481 tgggctgggg agaaaggggg agagagcaag 

3541 agtgtgtgtc ccccagaagt tccaggtctg 

3601 attaccctgt tgttcccatc cctgccttat 

3661 gcaggggcag ggatcagcca ctgctaccct 

3721 gggcagggac cacttgtgtc gagggtgaaa 

3781 cccagtcgcc acttcacact gaaaatgatg 

3841 aagttggttt tgacat 



tgcagacgag ggaagtccag tgcctgagtg 
ctcacctgag accttccagg aagcggccct 
atgaccaatg caaggacagc tctccacact 
tctaccccta ctacacaact acctgctgcc 
agctggaacc ggcttgaaac gggtcccata 
ccactgacta gcccgctctg taccctctgg 
gatcctccag ggtgagagac ctacagtgtg 
ggggtctgtc ctgtgtatgt ttatcacttc 
tgggcggtgc ctgtatccct catgcttttc 
gttatatttc cacactaatc tgagctacat 
tgccccaatc ctcgtctcca catcttctaa 
ctagtctctg tattcctacc tccctcatta 
accttactgg ggaaggacca cacctctcca 
aagagaaacg tcagcataag acatcctagc 
cccaccccag aagtaattta ttttttatta 



Disciaimer | Write to the Help Desk 
NCBI 1 NLM | N1H 



Jun 19 2003 12:37:45 



http://www.ncbi.nlm.nih.gov/entrez/query.fcgi ?cmd=Retrieve&db=nucleotide&list_uids=30L. 7/6/2003 



Entrez-PubMed 



Page 1 of 2 




Pub 




PubMed 



Nucleotide 



Protein 



Genome 



Structure 



PMC 



National 
Library 
of Medicine 

Taxonomy 




OMIM 



Be 



Search | PubMed 



Ml forj 
Limits 



Go 



..Clear 



Preview/Index 



History 



Clipboard 



Details 



About Entrez 



1 



Display Abstract 



lUl Show: 



Text Version 

Entrez PubMed 
Overview 
Help | FAQ 
Tutorial 

New/Noteworthy 
E-Utilittes 

PubMed Services 
Journals Database 
MeSH Database 
Single Citation Matcher 
Batch Citation Matcher 
Clinical Queries 
LinkOut 
Cubby 

Related Resources 
Order Documents 
NLM Gateway 
TOXNET 
Consumer Health 
Clinical Alerts 
ClinicalTrials.gov 
PubMed Central 

Privacy Policy 



□ 1: Gene. 2003 Mar 27;307:23-30. 



FUit-TEXT ARTICLE 



20 - Sort 



to ki 

a .. 



| Send to next 



Related Articles, Li 



TSRC1, a widely expressed gene containing seven 
thrombospondin type I repeats. 

Buchner DA, Meisler MH. 

Department of Human Genetics, University of Michigan School of Medicine 
Ann Arbor, MI 48109-0618, USA. 

The thrombospondin type 1 repeat domain is found in nearly 100 mammaliai 
proteins with diverse biological functions that include cellular adhesion, 
angiogenesis, and patterning of the developing nervous system. We have 
characterized a novel thrombospondin type 1 repeat containing gene, TSRC1 
encoding a predicted protein with seven thrombospondin repeats, six of whic 
are clustered at the C-terminus. The 17 coding exons and two nontranslated 
exons of TSRC1 span 10 kb of genomic DNA. The human and mouse genes 
encode proteins of 1074 and 1036 amino acids, respectively, with 76% amine 
acid sequence identity. Thirty of the extra amino acids in the human protein ; 
encoded by exon 6. Mouse Tsrcl is expressed in all fetal and adult tissues 
tested. Three conserved noncoding sequence elements with potential regulate 
function are located in intron 1. Mouse Tsrcl was genetically mapped to 
chromosome 3 within the nonrecombinant region for the sodium channel 
modifier locus Scnml. The sensitive and resistant alleles of Scnml did not 
differ in Tsrcl protein sequence, transcript length, or transcript abundance. 
Human TSRC1 is located on chromosome lq21 within an 11.7 Mb segment < 
conserved synteny. TSRC1 and the closely linked gene AD AMI 5 appear to 1 
derived by a chromosomal inversion that interrupted an ancestral ADAMTS 
gene. 

PMID: 12706885 [PubMed - indexed for MEDLINE] 
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